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Title 

Shape matching method for indexing and retrieving multimedia data. 

Field of the Invention 

5 The invention relates to a method for indexing and retrieving multimedia data. More 
particularly, the invention relates to calculating a similarity index between at least two 
sets of multimedia information based on shape feature information extracted from 
multimedia data objects. 

10 Background to the Invention 

Large image and video sequence databases are used in a number of multimedia 
applications in fields such as entertainment, business, art, engineering, and science. 
Retrieving images or parts of sequences based on their content, has become an 
important operation. 

15 

Shape analysis methods play an important role in systems for object recognition, 
matching, registration, and analysis. However, retrieval by shape is still considered to be 
one of the most difficult aspects of content-based search. 

20 The key to multimedia data retrieval is the following: the types of features of the 
multimedia data to be considered and how to express these features and how to 
compare between features. 

A common problem in shape analysis research is how to judge the quality of a shape 
25 description/matching method. Not all methods are appropriate for all kinds of shapes 
and every type of application. Generally, a useful shape analysis scheme should satisfy 
the following conditions: 

• Robustness to transformations - the result of analysis must be invariant to translation, 
30 rotation, and scaling, as well as the starting point used in defining the boundary 
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sequence; this is required because these transformations, by definition, do not change 
the shape of the object, 

• Feature extraction efficiency - feature vectors (descriptors) should be computed 
efficiently, 

5 • Feature matching efficiency - since matching is typically performed on-line, the 
distance metric must require a very small computational cost, 

• Robustness to deformations - the result of analysis must be robust to spatial noise, 
introduced by a segmentation process or due to small shape deformations, 

• Correspond to human judgement - a shape similarity/dissimilarity measure should 
10. correspond as much as possible to a human Is judgement. 



Known methods of representing shapes include the descriptors adopted by MPEG-7; 
Zernike moments [A. Khotanzan and Y.H.Hong. Invariant image recognition by zernike 
moments. IEEE Trans. PAMI, 12:489-497, 1990] and CSS [Farzm Moklitarian, Sadegh 

15 Abbasi and Josef Kittler. Robust and Efficient Shape Indexing through Curvature Scale 
Space. British Machine Vision Conference, 1996]. For the Zernike moment shape 
descriptor, Zernike basis functions are defined for a variety of shapes in order to 
investigate the shape of an object within an image. Then, an image of fixed size is 
projected over the basis functions, and the resultant values are used as the shape 

20 descriptors. For the curvature scale space descriptor, the contour of an object is 

extracted and changes of curvature points along the contour are expressed in a scaled 
space. Then, the locations with respect to the peak values are expressed as a z- 
dimensional vector. 



25 The Zernike moments and CSS descriptors have obvious advantages such as very fast 
feature matching and compact representation. Unfortunately, the majority of the 
compact shape descriptors (i.e. Zemike moments) are not robust to shape deformations. 
Others, like CSS, are robust but matching of such descriptors results in many false 
positives. The retrieval accuracy of the CSS method can be sometimes poor, especially 

30 for curves, which have a small number of concavities or convexities. In particular, this 
representation cannot distinguish between various convex curves. Another disadvantage 
of the compact descriptors is that their extraction is usually computationally expensive. 
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Although it is not a problem for creating databases (feature extraction is performed off- 
line), this makes it difficult (or even impossible) to use them for fast on-line comparison 
of two shapes provided as binary masks. 

5 Object of the Invention 

It is an object of the invention to provide a computationally efficient shape 
description/matching method for fast matching of two shapes provided as binary masks. 
It is a further object of the invention that this method be robust to shape deformations, 
and does not result in many false positive returns. It is yet a further objective that the 
10 technique of the present invention may be implemented in an on-line environment. 

Summary of the Invention 

Accordingly the present invention provides an improved technique for effecting 
15 a measaurement of the dissimilarity of two shapes. In accordance with a preferred 
embodiment a method of comparing at least two sets of multimedia data using shape 
information so as to provide a dissimilarity measure between the sets is provided, a first 
set forming a reference set and the second set being compared relative to the first set, the 
method comprising steps of; 
20 providing a set of contour points for each set of multimedia data, 

associating the contour points of the first set with contour points of the second 
set so as to define associated contour points, 

calculating distances between said associated contour points, and 
wherein the distances between said associated contour points are used to determine a 
25 dissimilarity measure between said sets of multimedia data, thereby providing an 
indicator of the degree of similarity between the sets. 

The set of contour points for each set of multimedia data is desirably obtained by 
tracing the boundary of each set of multimedia data. 



30 
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The sets of contour points may be down-sampled to yield an equal number of 
approximately equally spaced apart contour points in each set. 

The sets of contour points are desirably translation and scale-size normalised 
prior to association with one another. 

The method may include the additional step of establishing starting points for 
each set of contour points, the starting points being established by defining all contour 
points for each set whose distances from a calculated centroid point of their respective 
contour are greater than a user-defined first percentage PI of the most distant contour 
point from their respective centroid. Hie user may be enabled to input the first 
percentage PI. 

Starting points are desirably established in a contiguous manner. The contiguous 
starting points are typically split into groups which contain less than a user-defined 
second percentage P2 of the total number of contour points. The user may be enabled to 
input the second percentage P2. 

Each group of starting points is typically represented by the starting point which 
is most distant from the centroid. 

A reference starting point is desirably taken as the most distant starting point on 
the set of contour points with the largest number of starting points, or the largest initial 
area, and for each starting point on the set to be compared, the set to be compared is 
rotated to align its starting point with the reference starting point. 

Associated contour points are desirably associated with one another by providing 
a control point for each of the sets of contour points, and subsequently moving the 
control points sequentially along the contour of each set, and wherein the movement of 
the control points about the contour is used to associate contour points from each set 
with one another. 
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The movement of the control points for each set is preferably initiated at aligned 
starting points of each set. 

At each step of movement of the control points about the contour, the distance 
between the two control points, the distance between the control point on the first set 
and the next contour point on the second set, and the distance between the control point 
on the second set and the next contour point on the first set is typically calculated, and 
the contour points with the smallest of these calculated distances are associated with 
each other. 

The contour points are usually assigned until the control points have moved 
about all of the contour points. 

The dissimilarity measure may be determined by effecting a measure of a scale- 
size normalised circumference of each set, determining the minimum of said 
circumferences, and further determining the average and the standard deviation of the 
distance between associated contour points. 

The dissimilarity measure is typically provided by the equation: 



where: 

D(A,B) is the dissimilarity degree between set A and set B 

c denotes a user-defined constant which rescales values of the dissimilarity degree to 
a convenient range of values, 

cir mlTi is the minimum circumference 

|Z5j is the average distance between associated contour points, and 

\V\ is the standard deviation of the distance between associated contour points. 

In a preferred embodiment, a plurality of dissimilarity measures are calculated, each 
dissimilarity measure being associated with a specific alignment of the sets being 
compared, and wherein a final dissimilarity measure providing an overall indication of a 
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level of equivalence between the sets is given as the lowest of all the calculated 
dissimilarity measures. 

These and other features of the present invention will now be described with reference 
to the following drawings. 

Brief Description of the Drawings 

Some embodiments of the invention are hereinafter described with reference to the 
accompanying drawings wherein: 

Fig. 1 shows the main steps of a shape analysis algorithm according to the present 
invention; 

Fig. 2 is an example of contour tracing as implemented according to an aspect of the 
present invention; 

Fig. 3 shows an 8-neighbourhood as utilised in a method step of the present invention; 

Fig. 4 describes contour tracing steps such as those used in the present invention; 

Fig. 5 shows how contour points are chosen according to the present invention; 

Fig. 6 shows areas used to calculate the moments of the contour according to the present 

invention; 

Fig. 7 gives an example of translation and size normalisation using MPEG-7 test 
images; 

Fig. 8 gives an example of rotation compensation using MPEG-7 test images; 
Fig. 9 gives an example of starting point extraction using an MPEG-7 test image; 
Fig. 10 gives an example of assigning the contour points of two shapes; by simple 
assignment, and by the present invention; 

Fig. 11 shows a sample step of a contour point assignment according to the invention. 



Detailed Description of the Drawings 

According to the present invention a method of comparing at least two sets of 
multimedia data using shape information to provide a dissimilarity degree is provided. 
Within the present invention the concept of how to express features may be considered 
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as a shape description whereas the comparsion of features may be considered as shape 
matching. A dissimilarity degree or measure provides an indicator of the similarity 
between a first set of data and a second set which is tested against the first set. The sets 
of multimedia data are typically provided in the form of binary images or contours. The 
5 methodology of the present invention is typically implemented in a software sequence 
which is operational in two main stages: feature extraction 1 and feature matching 2. 
Figure 1 shows a flow sequence according to the invention adapted to provide for such 
stages. 

10 In the first stage, the boundaries of the shapes from the input binary images are traced. 
The extracted contours are translation and scale-size normalized, and the most likely 
starting points for both shapes are estimated. 

In the second stage the dissimilarity measure is calculated using features extracted in the 
15 first stage. The starting points from both shapes are assigned into pairs. Before 

evaluating the dissimilarity measure for a given pair of starting points, the rotation 
between these points is estimated and one of the shapes is rotated. The dissimilarity 
between two shapes is calculated for each pair of starting points. The lowest overall 
dissimilarity is taken as the dissimilarity degree measure between two shapes. 

20 

The sequence flow can be further separated into a series of individual steps which are 
illustrated below as forming a sequential operation. : 

Contour tracing (Step 100) 

25 In the contour tracing step, an ordered sequence of the boundary pixels is extracted. —To 
extract such a set, a technique such as contour tracing (which may also be known to the 
person skilled in the art as border following or boundary following) is applied to the 
input binary images. The application of such a tracing algorithm ignores any "holes" 
present in the shape. For example, with reference to Figure 2, if a shape like that of an 

30 "A" 3 is to be traced, the contour traced by the algorithm will be similar to the external 
(grey) pixels of the "A" 4. 
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To explain the tracing algorithm, the concept of the 8-neighborhood [T. Pavlidis, 
Algorithms for Graphics and Image Processing, Computer Science Press, Rockville, 
Maryland, 1982] of a pixel first needs to be defined. As shown in Figure 3, the 8- 
5 neighborhood 5 of a pixel c 6, is the set of 8 pixels, termed pixels cl, c2, c3, c4, c5, c6, 
c7 and c8, which share a vertex or edge with that pixel. 

The tracing algorithm operates as illustrated in Figure 4: Given a digital shape (a group 
of white pixels on a background of black pixels), a white pixel is located and declared as 
10 a starting contour pixel 6. Locating a starting pixel can be performed in a number of 
ways. Using the assumption that the contour is traced in a clockwise direction, the 
starting pixel has to satisfy the restriction that the left adjacent pixel is not white. Now, 
its coordinates are stored and the starting pixel is set as the current contour pixel. 

15 From the current contour pixel, a search is performed in its neighborhood for the next 
contour pixel. To perform this search the current direction 8 is very important. It can be 
defined as a line connecting the previous contour pixel with the current contour pixel. 
The current direction is used to determine from which pixel in the current pixel 
neighborhood one should start to search for the next contour pixel. Mote that the current 

20 direction does not change during the current pixel neighborhood search. It changes only 
when the next contour pixel is made the current contour pixel (one step forward in the 
contour). When the tracing begins, the current pixel is also the starting pixel. In this case 
the current direction is defined as the direction connecting the starting pixel with the 
pixel on top of it 7. This direction can be assumed because of the restriction that was 

25 made in choosing the starting pixel . 

If the current contour pixel and previous contour pixel only share a vertex (this can be 
determined directly from the current direction), the searching starts from the pixel 9 
which lies 90 degrees to the left of the current direction. In the worse case, the 6 shaded 
30 pixels 10 have to be checked. In the presented example the search finishes at the fourth 
pixel 11. 
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If the current contour pixel and previous contour pixel share an edge, the search starts 
from the pixel 12, which lies 45 degrees to the left of the current direction. The pixel on 
the left of the current contour pixel does not have to be checked, because it was already 
checked when the previous contour pixel neighborhood was searched. An example is 
5 shown in Figure 4(d). This time, in the worse case, the 7 shaded pixels 13 in the current 
contour pixel neighborhood have to be examined. In the example, searching finishes at 
the second pixel 14. 

When the next white pixel from the tracked contour is found, the algorithm moves to 
10 this pixel, making it the current contour pixel. Its coordinates are stored, the current 
direction is updated and its neighborhood is examined as explained above. This 
procedure continues until the starting pixel is visited again. 

The proposed tracing algorithm can be easily and efficiently implemented. Modulus 8 
15 operations or look-up tables can be used to determine which pixels in the current pixel 
neighborhood should be checked, and in which order. It will be appreciated that the 
above described is a preferred tracing algorithm but that any suitable alternative could 
also be used in the context of the present invention .It is convenient to determine and 
store the distances between contour pixels during the contour tracing, since the length of 
20 the whole contour as well as its separate segments has to be known at the next stage of 
the shape analysis algorithm. It can be performed in a very efficient manner using a 
concept known in literature as Freeman code or chain code (see: Freeman H. Computer 
processing of Line-Drawing Images, Computing Surveys Vol6, No 1, March 1974). 

25 A chain code describes the boundary in terms of the directions of the line segments 
connecting a chain of pixels. The representation can be based on 4- or 8-connectivity . 
The direction of each segment is encoded using the numbering scheme illustrated in 
Figure 3. So, not only the vectors with coordinates of the contour pixels will be stored 
after contour tracing, but also the vectors of directions between pixels. 



Using the extracted contours, the technique then applies a normalization step to each of 
the two images to be compared (Step 105). This normalization step typically includes 
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two sub-steps: firstly the extracted contours are downsampled and then the 
downsampled contours are translated and normalised with respect to one another . 

Contour Down-sampling (Step 105b) 

5 

From the previous stage it will be understood that each set of data is now provided or 
represented as an ordered sequence (x,y) of the contour pixels and a chain code is also 
available. The purpose of this stage is to extract a vector of the contour points from the 
set of contour pixels, which will be equally distributed along the curve. 

10 

This stage is required because the similarity degree measure used in the shape analysis 
scheme of the present invention is based on distances between corresponding shape 
control points along the shape curve. Arising from this, an approximately equal number 
of contour points for both compared shapes is required (see Figure 1). It will be apparant 

15 that the number of contour pixels in the shape can vary significantly due to scale 

changes. Another problem is related to the fact that the vertical and horizontal segments 
between adjacent contour pixels each have unit length, while the diagonal segments 
contribute V2 . Because of this fact, the subjective number of contour pixels in different 
parts of the contour also changes after rotation. The last reason for introducing the 

20 control points is the computational cost of the algorithm, which depends on the number 
of points representing the contour. Some of the region^ contours obtained from a typical 
CIF sequence (352 x 288 pixels) can have more than 1000 pixels. Experiments show, 
that in most cases, 1 00 points is enough to represent the contour in order to preserve the 
most significant contour features. 

25 

There are many heuristic approaches for approximating contour curves using a small 
number of vertices or segments. The general approach is splitting or merging contour 
segments until the chosen approximation error is achieved. In the proposed down- 
sampling algorithm the main aim is to extract contour points in such a way that they will 
30 be almost equally distributed along the contour, i.e. from the whole set of contour pixels 
a subset of pixels which are approximately equally distant will be chosen. This can be 
easily performed during one scan of the contour pixel sequence. 
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The maximum number of contour points for the shape is determined by its number of 
contour pixels. Since both shapes must have the same number of contour points, the 
maximum number of contour points for both shapes is defined by the number of contour 
5 pixels in the shape which has the smaller number of contour pixels. If the shape 

descriptors are stored in a database the number of contour points should be fixed for all 
shapes in this database. 

According to the techniques of the present invention a selection of contour points is 
10 provided in a very simple and efficient manner. The idea is to move along the contour 
choosing the pixels for contour points. The position of the pixel along the contour can 
be efficiently calculated from the chain code. 

As illustrated in Figure 5(b), only pixels which have the closest position in the contour 
15 length to the "ideal" position of the contour points are chosen. The coordinates of the 
chosen pixels are copied to contour point vectors (one for the x and one for the y 
coordinate). 

From this point on, the contour of the shape is represented by the ordered sequence 
20 (x(s) 9 y(s)) of the contour points (vertices), where s denotes the position in the contour 
and can have integer values from 0 to N , where N denotes the total number of contour 
points used to represent the shape. 

Translation and Scale Normalization (Step 105c) 

25 

Clearly, moving a region from one location in the image to another and/or changing the 
size of it should not affect the shape analysis. Since the center of the shape is invariant 
to translation, rotation and scaling, the present invention utilises a location of the 
centroid of the shape as part of the method of solving the translation problem. In the 
30 presented approach, the centroid of the object curve is translated to coordinates (0,0) , 
which make the object curve translation invariant. Scaling invariance may be obtained 
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by normalizing the shape curve by shape area (see for example: Dinggang Shen, Horace 
H.S., Discriminative wavelet shape descriptors for recognition of 2-D patterns, Pattern 
Recognition 32, pp 151-156, 1999). This translation and scaling invariance can be 
achieved using the regular moments as described in the following subsection. 



Calculation of The Moments 

The definition for a regular moment m M is: 

m M " \\x"y q nx,yydxdy (1) 
For binary images, the above equation can be expressed in a discrete form as follows: 

R 

In the presented scheme all low order moments needed for shape normalization are 
derived from the boundary contour points. This approach is much faster in comparison 
to calculating moments from the above definition equations. For calculating moments 
from the boundary, a similar method to the one shown in "Herbert Freeman. Computer 
processing of Line-Drawing Images, Computing Surveys, Vol 6, No 1, March 1974" for 
deriving the area of the shape from a simple chain code may be used. Here this method 
is extended for moments m 0l and m l0 . 

With reference to Figure 6(a), the algorithm moves from one contour point to another in 
a clockwise direction 15 around the contour. The moments are calculated as a sum of 
appropriate moments of trapezoids created from the current and previous contour point 
and the *(or y) axis .[for moment m' 0l trapezoids are created from the current, previous 
and y axis] An example of such a trapezoid 17 is shown in Figure 6(b) for and m' 01 
only .The vertices of the trapezoid are (x(c),j>(c)) , (*(p),y(p)) , (*0>),y(0)) , and (x(c),0) 
where c and p denote current and previous contour point respectively. 

From equation 2 the formulas for the low order moments of trapezoid (m' m , m{ 0 , m' 0l ) as 
functions of its vertices can be easily derived: 
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13 

-|6<p)-*(c)H>(c)+jfCp)], < 3 > 

= i^(p)-x(c)].[x(c) + x(/7)].[y(c)+y(p)], (4) 
4 

« =4[y(^)-3'(p)][ x ( c ) +;c ^>]t>'( c > + >' (p) l' (5) 



10 



4" 

Note that m^, and m{ 0 have negative values for vertices from the upper border of the 
shape (subtractive component 18) and positive values for vertices from lower border 
(additive component 19). Similarly m' 0l has negative values for the left part of the shape 
and positive for the right part (not shown in the figure). 

Low order moments of the whole shape can be calculated as a sum of moments of all 
trapezoids as follows: 

where V denotes the set of all vertices (contour points) of the shape. 

The computation of all nesseccary moments (m' m , m[ Q , m 0 i) of the sha P e takes time 
linear to the number of vertices. Equations 3,4 and 5 can be implemented in a very 
efficient manner since they have many common components (sums). 



Normalization 

Using low order moments, the coordinates of the center of the shape can be calculated as 
follows: 

Xe= ^,y c= Ht2L (7) 

m 0Q m O0 

where x c and y c are the coordinates of the centroid of the shape. 

Scaling invariance can be obtained by rescaling the coordinates of the shapes vertices by 
the central moment . The scaling factor of the present object size, compared with the 
expected size, is: 
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(8) 

AREA 



where AREA is a chosen constant - the same for all compared shapes. 

Finally, the translation and scale normalized shape can be obtained by changing the 
5 vertices coordinates according to the following transformation: 

v * *fr)-*c * M - y&t2s (9) 

x (s) = , y (s) = \?J 

a a 

An example of translation and size normalization of the shapes is shown in Figure 7. 
10 The original shape, from the MPEG-7 test set, is shown in Figure 7(a), while Figure 7(b) 
shows a rescaled (5 times smaller) and translated version of the same shape. Finally, 
Figure 7(c) shows extracted vertices of the above contours, and Figure 7(d) shows 
vertices after translation and scale normalization. 

15 For the rest of this description we let the sequence of (x(s), y(s» represent the translation 
and scale normalized contour. 



Extraction of the starting points (Step 110) 

20 It will be appreciated that a rotation of the object region or changing the point from 
which the tracing of the boundary of the region started should not affect the shape 
analysis. One possible solution to this problem could be to evaluate 
similarity /dissimilarity measure for every pair of contour points. 

25 In the presented approach, the set of the most likely starting points is chosen from the 
shape to speed up the search. The chosen set depends mainly on the shape and it is 
assumed that only some of the starting points from the whole set will be affected by its 
deformations. The number of chosen starting points can be regulated by two parameters. 
More than one set of starting points can be stored for one shape in the database to allow 

30 the user to choose the speed and accuracy of the search for the optimal match. 
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According to a preferred implementation of the present invention, the following scheme 
for choosing starting points is used: 

1. Find the most distant contour point from the centroid (distance from 
the centroid), 

2. The threshold distance T d is defined. The contour points which are 
equally or more distant from the centroid than T d are marked as potential starting 
points. The threshold distance T d can be expressed as a fraction of the maximum 
distance: 

T^ Pa d max (10) 
where parameter p d is a real number in the range (0,l) . 

3 . For each continuous group of marked contour points, one is chosen as the 
starting point for this part of the contour. Experiments show that the most distant 
contour point in the group is a good choice. To avoid a situation where too many 
contour points will be represented by just one starting point, a maximum number 
of marked points T„ is introduced, which can be represented by one starting 
point: 

T m -t Pm .N-\ (ID 

where N denotes the number of all contour points representing the shape and p„ 
denotes a number within the range (0 j) . 

If the group has more points than T rt , it is divided into smaller groups which 
have equal or less points than T n . Each group is then represented by one starting 
point (the one most distant from the centroid). 



The intensity of the search for the optimal rotation between two shapes can be varied by 
Pd and Pn . It is possible by changing parameters Pd and p R to perform fast, normal or 



WO 2004/088538 



PCT/IE2004/000042 



16 

extensive search for optimal rotation alignment. Although the search is performed at a 
later step, the intensity of the search depends on how many starting points are chosen at 
this stage- As such it can be a key contributing factor to the overall efficiency of the 
process. However, in most cases good results can be achieved by choosing the values as 

5 p d = 0.7 and p n = . They lead to around 2 to 4 starting points for elongated shapes 

and to around 20 for circular shapes. An example of an extracted starting point set is 
shown in Figure 9. 

Matching (Step 120) 

10 

In this stage the search for the best rotation alignent between two shapes is performed. 
The extracted set of starting points are used to narrow down the search space. For each 
checked rotation the similarity/dissimilarity measure is calculated. The lowest 
dissimilarity measure obtained during this search is taken as the final dissimilarity 
15 degree between two shapes. The detailed description of calculating a dissimilarity 
measure for given set of starting points is explained in the next subsection. 

In the presented technique the preferred search operates as following: 

20 1 . A set of starting point pairs from both shapes for which the 

similarity /dissimilarity measure will be evaluated is created. In the current 
implementation the pairs are combined in the following way: (sp ref ,sp x ) , 

(sprcf > s Pz) >— ( s Pref * S P * ) > where sp ref is the most distant starting point from the 
shape chosen as the reference and K is the total number of starting points from 

25 the shape chosen as the rotated one. Usually it is better to choose the shape with 

a smaller number of starting points to be the one rotated, and the shape with 
more starting points to be the reference shape (this decision is application- 
oriented). If both shapes have the same number of starting points, the shape with 
a smaller initial area (before normalization) is chosen for rotation and that with a 

30 bigger area as the reference shape. 
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2. The rotation for each pair of the starting points (see example in the Figure 
8) is compensated and the dissimilarity is evaluated. The pair with the lowest 
dissimilarity measure are selected (let it be (sp ref >sp x )). 

3. The dissimilarity for pairs of sp rcf with starting points from the 

5 neighbourhood of sp x are evaluated as in the step 2. $p x the neighbourhood is 

searched and dissimilarity measures for pairs (...(sp r<r/ jsp^) > (fPrcf >sPx-0 » 
(sp ref ,sp x+l ) AsPref > s Px+2) •») are calculated. The search is continued in both 
directions until the dissimilarity values start to increase 

10 4. The lowest dissimilarity measure obtained from all checked starting point 

pairs is taken as the final dissimmilarity measure between the shapes. 

The rotation between starting points can be compensated as illustrated in Figure 8. The 
subjective rotation between starting points from both shapes can be estimated as the 
15 angle a between these starting points and the aligned centroids of these shapes. The 
rotation is compensated by rotating one of the shapes around angle a . The new 
coordinates of the contour points for the rotated shape can be calculated from the 
following equations: 

20 x \s) = x(s) cos a -y(s) sin a (12) 

y ' (s) = x(s) sin a + y (s) cos a (13) 

Similarity/Dissimilarity Measure 

25 TTie presented similarity /dissimilarity metric is based on the distances between the 

contour points of two shapes. Before calculating the similarity/dissimilarity between two 
shapes, the correspondence problem between the contour points from both shapes must 
first be solved. The simple assignment of the contour points could be based on their 
indexes (staring from the starting points), but it is not robust to shape deformations 

30 (especially contour noise). An example of such simple assigment is shown in the Figure 
10. Figures 10(a) and (b) show the shapes for which the degree of dissimilarity has to be 
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calculated. Both shapes are identical except that the top of the shape shown in Figure 
10(b) is corrupted by segmentation noise. Figure 10(c) demonstrates a simple contour 
points assignment. From this example it is obvious that the above assignment of contour 
points fails. In the presented method a semi-optimal solution to this problem is 
proposed. It makes use of two control points moving along the shape. It is deformation 
and noise robust and yet the complexity is 0(N) . 

Imagine that two control points cp A and cp B are moving along the contours A and B , 
beginning from aligned starting points. They move along contours in N % steps and for 
each step the distance between them is obtained. Each step consists of two stages. 
During the first stage, each control point moves one position forward in the contour 
point sequence. For the new position, the euclidean distance \d c (i)\ ^between the 

coordinates of these control points is calculated. During the second stage, the distances 
between control points and next contour points in another contour ( \d AB (f)\ and 1^(01) 
are calculated. If one of these is smaller than \d c (i)\ the appropriate control point moves 
one position forward to minimize the distance between cp A and cp B . The distance \d(i)\ 
for the step i is obtained as a minimum of the distances |rf c (i)| , \^ab(0\ and (d^O*)! . 

In this way control points move together along the contours as closely as possible. 
Usually they move only one position forward during one step, but one of them can move 
two positions if it produces a smaller distance between them. In other words, in some 
parts of the contours the control point from shape A can move faster whereas for 
another parts the control point from shape B can move faster. In this case faster means 
two positions in the contour point sequence per one step. Note that in such a way some 
contour points are skipped. An example of a step while skipping one contour point is 
shown in Figure 11(b). The distance |rf^(/)| is shorter than \d c (i)\ so the control point 
cp B in the shape B moves one further step forward, and (^(Ol is taken as the distance 
for this step. 

Figure 10(d) shows the manner in which the contour points in shapes shown in Figures 
10(a) and (b) were assigned by the moving control points. The improvement (comparing 
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with Figure 10(c)) is visible by the decreased complexity of the lines showing point 
assignments, especially for parts {2} and {4} of the shapes. 

The distances between the contour points assigned in this way are used to calculate the 
5 similarity/dissimilarity between two shapes. 

When walking simultaneously along two contours the measured distances between 
contour points from both shapes should be relatively small. Moreover, they should not 
change along the curves. If the shapes are similar, the standard deviation of these 
10 distances should be small and strongly dependent on the degree of dissimilarity between 
the shapes. The proposed metric takes advantage of this facts. It is based on both: the 
geometrical distances between assigned contour points and their variations along both 
contours. 

15 The average distance between contours can be calculated from the following equation: 

where |rf,| denotes the euclidian distance between the coordinates of the contour points 
assigned in step i. 

20 Hie standard deviation of the distances can be calculated from the following equation: 
The proposed dissimilarity degree measure can be defined as: 



D(A,B) = -±-.\D\.\V\ 

mm 

where: 

• c denotes the chosen constant which rescales values of the dissimilarity 
degree metric to a convenient range of values (calibration of the algorithm); 
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• cir mln is the minimum circumference of the circumferences of both 
normalized shapes. 

cir min adjusts the dissimilarity metric to human perception of dissimilarity of two shapes 
depending on the complexity of the shapes. It has been observed by the present 
inventors that humans perceive even small differences between two simple shapes as 
significant, in contrast with the differences between two more complicated shapes for 
which small differences are not so important. Assuming, that the shape is size 
normalized the circumference can be used as a measure of the complexity of the shape. 

Such a dissimilarity degree measure gives values close to zero for very similar shapes 
and values one and greater then one for significantly different shapes. Based on this the 
percentage similarity degree measure can be defined as: 



It gives values from 0% to 100% (values close to 0% for significantly different shapes 
and close to 100% for similar shapes). 

It will be appreciated that a method of providing a dissimilarity measure between two 
sets of data has been described. Such a method according to the present invention 
enables a real time comparison between a first or reference set of data and a second set 
which is compared against the first. The use of such techniques is advantageous for 
retrieval performance when used for searching databases for images having similar 
shapes to a query image . The presented method outperforms all existing shape matching 
algorithms in the MPEG7 Core Experiment CE-Shape-1 (part B). Compared to existing 
methods, the proposed shape analysis technique is very fast for on-line similarity 
calculations between shapes taken directly from binary images. 
.By implementing a technique according to the present invention it is possible to 
efficiently develop applications which require a large number of comparisons between 
images of binary shapes. One example is advanced image databases. When determining 
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the similarity between images a number of features can be used such as colour and 
texture. However, another important feature is the shape of the objects present in the 
scene. As such, this approach is an important tool when indexing large archives of 
images for browse and search and retrieval. Furthermore, due to its efficiency it could be 
5 successfully applied to video clips and find application in shape-based retreival from 
video archives. Current producers of video managment tools typically use only use 
features based on a temporal segmentation of the video and extracted keyframes. 
However, the next generation of these systems will require more sophisticated features 
and the proposed approach is an ideal candidate for this. Another example application 
10 where similar technology is already being used is trademark logo databases. When a 
new trademark or logo is designed, its shape characteristics canbe used to determine 
whether or not it is too similar to existing trademarks or logos .Using techniques such as 
provided by the present invention enable a measurement or dissimilarity value to be 
calculated so as to enable empirical comparisons. 

15 

It will be appreciated that the present invention has been described with reference to a 
preferred embodiment thereof and that any sequence of steps outlined above are 
illustrative of the techniques of the present invention and that it is not intended to limit 
the application to any one set of steps of sequence thereof except as may be deemed 
20 necessary in the light of the appended claims. 



